Cardiac ultrasomics for acute myocardial infarction risk stratification and prediction of all-cause mortality: a feasibility study

Background Current risk stratification tools for acute myocardial infarction (AMI) have limitations, particularly in predicting mortality. This study utilizes cardiac ultrasound radiomics (i.e., ultrasomics) to risk stratify AMI patients when predicting all-cause mortality. Results The study included 197 patients: (a) retrospective internal cohort (n = 155) of non-ST-elevation myocardial infarction (n = 63) and ST-elevation myocardial infarction (n = 92) patients, and (b) external cohort from the multicenter Door-To-Unload in ST-segment–elevation myocardial infarction [DTU-STEMI] Pilot Trial (n = 42). Echocardiography images of apical 2, 3, and 4-chamber were processed through an automated deep-learning pipeline to extract ultrasomic features. Unsupervised machine learning (topological data analysis) generated AMI clusters followed by a supervised classifier to generate individual predicted probabilities. Validation included assessing the incremental value of predicted probabilities over the Global Registry of Acute Coronary Events (GRACE) risk score 2.0 to predict 1-year all-cause mortality in the internal cohort and infarct size in the external cohort. Three phenogroups were identified: Cluster A (high-risk), Cluster B (intermediate-risk), and Cluster C (low-risk). Cluster A patients had decreased LV ejection fraction (P < 0.01) and global longitudinal strain (P = 0.03) and increased mortality at 1-year (log rank P = 0.05). Ultrasomics features alone (C-Index: 0.74 vs. 0.70, P = 0.04) and combined with global longitudinal strain (C-Index: 0.81 vs. 0.70, P < 0.01) increased prediction of mortality beyond the GRACE 2.0 score. In the DTU-STEMI clinical trial, Cluster A was associated with larger infarct size (> 10% LV mass, P < 0.01), compared to remaining clusters. Conclusions Ultrasomics-based phenogroup clustering, augmented by TDA and supervised machine learning, provides a novel approach for AMI risk stratification. Supplementary Information The online version contains supplementary material available at 10.1186/s44156-024-00057-w.


Background
Globally, acute myocardial infarction (AMI) affects nearly 10% of people over 60 years of age [1].In the United States, the total annual cost of AMI was $85 billion in 2016, with an estimated $40 billion lost due to premature mortality in the preceding decade [2].Unfortunately, despite the success of intervention and evolving guideline-directed treatment, AMI patients continue to have high morbidity and mortality [3].Currently, clinicians use validated risk stratification scoring systems, such as the Global Registry of Acute Coronary Events (GRACE) [4,5] and more recently the GRACE 2.0 score [6], to predict the 6-month and 1-year risk of all-cause mortality following AMI.While guidelines have recommended using the GRACE score as the most robust model for all acute coronary syndrome types [7][8][9], these scores were developed using clinical trial data long before percutaneous interventions became routine.Moreover, GRACE uses conventional statistical approaches (i.e., logistic regression) with fixed linear assumptions on data behavior and limited variables, resulting in modest discrimination (e.g., C-statistic range for predicting mortality:0.65-0.8)[5,9].
Artificial intelligence (AI) techniques have led to the development of novel methods that includes subjecting images and other inputs to sophisticated algorithms to capture complexity of human health and disease at the level of the individual [10].These methods have achieved remarkable success, especially in disease classification and risk assessments, in several image-based disciplines, such as dermatology, gastroenterology, ophthalmology, oncology, and neuroradiology [10][11][12][13][14][15][16], including the development of 'omics'-based decision support tools [17][18][19][20][21].The application of radiomics to cardiac ultrasound (i.e., ultrasomics), may aid in risk stratification of patients experiencing an AMI by extracting texturebased information from the myocardium.Moreover, the development of automated tools that integrate ultrasomics for AMI risk stratification addresses the existing gap in current guidelines which do not currently integrate cardiac imaging-based information in existing tools like GRACE 2.0 for estimating risk.
In the present study, we used a cluster-then-predict approach for AMI risk stratification.We subjected cardiac ultrasomics information to topological data analysis (TDA)-a robust method to create compressed representations of highly dimensional data to create unique patient phenogroups [22].We illustrate that the ultrasomics phenogroups can provide independent and incremental information to conventional tools like GRACE 2.0 for augmenting 1-year mortality prediction in AMI patients.Moreover, TDA can be effectively combined with machine learning and explainable AI techniques.Accordingly, we also illustrate the ability to develop robust supervised machine-learning algorithms on clustered patients, which can be applied to external data for phenogroup prediction.Since infract size is strongly associated with all-cause mortality in AMI [23], we used the Door-To-Unload in STEMI (DTU-STEMI) Pilot Trial [24] as an external, prospective, multicenter clinical trial cohort to illustrate that the high-risk phenogroup had larger infarct size as observed on cardiac magnetic resonance (CMR) imaging.

Study population
For the internal validation dataset, AMI patients were retrospectively identified from the electronic medical record of Robert Wood Johnson University Hospital who were admitted over a 6-month period between January 2023 to July 2023 (Fig. 1).The Institutional Review Board (IRB) of Robert Wood Johnson University Hospital gave ethical approval for this work (#Pro2023001660).STEMI was classified per the Joint ESC/ACCF/AHA/WHF Task Force [25].Exclusion criteria included [1] patients discharged to institutionalized care [2], type 2-5 AMI [3], co-existing terminal illness with palliative care for cancer, neurological illness (severe dementia, motor neuron disease, multiple sclerosis, Parkinson's disease, stroke, supranuclear palsy and multiple system atrophy), heart, lung, kidney or liver failure [4] alternative diagnosis for elevated cardiac troponin values (e.g.myocarditis, pericarditis, non-ischemic cardiomyopathies, moderatesevere valvular heart disease, sepsis, aortic dissection, blunt cardiac injury, coronary spasm and vasculitis, arrhythmia and cardiac arrest), and [5] pregnancy.After applying the exclusion criteria, 208 patients were initially enrolled (i.e., 87 patients classified as having a non-STelevation myocardial infarction (NSTEMI) and 121 as having a ST-elevation myocardial infarction (STEMI)).Of the 208 patients initially enrolled, 53 patients were further excluded from analysis due to technically insufficient imaging for 2 of the following 3 views: apical 4 chamber (A4C), apical 3 chamber (A3C), and apical 2 chamber (A2C).Technically insufficient imaging was classified as an inability to delineate the left ventricle (LV) endocardial boundaries on visual inspection for 2 or more segments.After excluding patients without at least two of the three apical views, 155 patients were identified for subsequent analysis (including 63 patients classified as having a NSTEMI and 92 as having a STEMI).We assessed the performance of the GRACE 2.0 score [6] with the primary outcome of all-cause mortality at one year.
For the external validation dataset, participants were recruited from a prospective, multicenter, randomized Door-To-Unload in ST-segment-elevation myocardial infarction (DTU-STEMI) pilot trial [24] (Fig. 1).We included 42 participants (all participants classified as having a STEMI) with CMR data in the current study.Infarct size on CMR was used as the primary end point.CMR-quantified infarct size was categorized as large (LGE mass accounting for > 10% of the total LV mass) or small (LGE mass accounts for ≤ 10% of the total LV mass) [26,27].The details of the CMR protocol have been previously described [24].Briefly, patients in the DTU-STEMI trial underwent standard CMR with steady-state free-precession sequence for LV ejection fraction, volumes, and mass analysis on days 3 to 5 and again on day 30 (± 7 days).For the external cohort, institutional review boards at each site approved the trial, and patients provided written, informed consent.The study was approved by the Food and Drug Administration (NCT03000270, Registration Date: 12/12/2016, Last Update: 05/06/2019).

Echocardiography image acquisition, preprocessing, and semantic segmentation
Echocardiograms from A4C, A3C, and A2C were utilized in the present studies for both the internal and external validation data analysis.Patients and participants required at least two of the three views to be present to be included in the current study (see Materials, section Study Population).2D echocardiograms were preprocessed from video formats to DICOM using Sante DICOM Viewer Pro (SanteSoft, Nicosia, Cyprus, Greece).DICOM files containing doppler data, dual ultrasound regions, or other with limited technical views were discarded.A4C, A3C, and A2C multi-beat echocardiogram DICOM files were manually selected.The LV was segmented in the A4C, A3C, and A2C views using echocv [28] (i.e., a semantic segmentation algorithm that automatically defines regions of the heart in echocardiography images through convolutional neural networks (CNNs)).
Echocv and its validation has previously been published [28], we modified echocv to be executed using Python 3.2 and leveraged TensorFlow 1.15.0 with GPU support, alongside CUDA 10.0.The segmented images were also uniformly resized to a fixed shape of 1024 by 1024 to ensure consistency across various image sources.Otherwise the use of algorithm and its validation has previously been published, specifically for predicting LV remodeling in parasternal long axis echocardiograms [29].Using the semantic segmentation algorithm, a binary mask representing the region of interest (ROI) within the A4C, A3C, and A2C views was achieved (Figure S1A).The ROI for each of the three views was then processed to obtained radiomics/ultrasomics-based information.) were retrospectively identified over a 6 month timeline who were admitted for AMI.Of these patients, 155 patients were included in the study who had at least two of three apical echocardiographic views available for analysis.Using ultrasomics features from the images, topological data analysis was used to cluster patients into three groups.These three groups were assessed in a supervised machine learning algorithm to develop class labels for the external validation group.Ultimately, groups clustered using ultrasomics features were assessed for prediction of all-cause mortality and left ventricular infarct size First-Order, shape, and texture-based feature extraction Echocardiography ultrasomics were extracted in Python (v3.7.13) using pyradiomics (v3.0.1) [30], SimpleITK (v2.2.0) [31], pywavelets (v1.3.0), and numpy (v1.21.5) for both the internal and external validation sets.We have previously published using this methodology on the LV [29].Briefly, feature extraction was performed for the 2D ROI using featureextractor() from pyradiomics.Default parameters for extraction, binwidth, resampled pixel spacing, interpolator, label definition, were applied.In total, first-order (n = 18), shape (n = 9), and texture-based (n = 73) features were extracted for each of the echocardiography views (i.e., A4C, A3C, and A2C) (Figure S1B).

TDA
The online tool TDAView [32] was used for phenogroup cluster of AMI patients in the internal validation set.Briefly, TDAView utilizes the Mapper algorithm based on TDAmapper [33].This includes user defined variables for Mapper such as: filter function, number of intervals, proportion of overlap, and number of bins in single-linkage clustering.The Mapper function allows geometric information to be converted into high dimensional point cloud data that can be interpreted by varying filters [33].Our goal with the current work was to delineate AMI patients with "high-risk" features from those with "lowrisk" features when predicting all-cause mortality.By decreasing the number of bins and the range of the lens values (i.e., intervals), we can effectively decrease the amount of oversampling and number of edges created from the resultant clusters.We used a 1D Mapper filter with distance function as Euclidean and filter function as mean.Number of intervals was defined as 10, with 5 bins.To reduce the overlap between clusters, a 5% overlap was defined.The number of clusters was not fixed.Based on the parameters used in TDAView, three clusters were generated, labeled as Cluster A (n = 62), B (n = 43), and C (n = 50).

Supervised machine learning classifier
BigML (https://bigml.com.BigML, Inc. Corvallis, Oregon, USA) was utilized for supervised machine learning and to develop a classifier for prediction of patients in Cluster A, B, and C. Weights were applied to Cluster A (weight = 1), Cluster B (weight = 1.189), and Cluster C (weight = 1.023) to address class imbalance.Through the OptiML application (i.e., a supervised machine learning algorithm that compares generated ensembles, deep neural networks, and logistic regression algorithms) 10-fold cross validation was performed and prediction of Cluster A, B, and C phenogroups was performed using only ultrasomics features.Once the supervised classifier was developed, the external validation set (n = 42 participants) was analyzed by the model to generate predicted class labels.These class labels (i.e., Cluster A, B, and C) were used for subsequent outcome prediction.

Statistics
GraphPad Prism (v10.1.1)and R (v4.1.0)were used for statistical analyses.The Shapiro-Wilk test assessed normality.In normally distributed data with continuous variables, a two-sided Student's t-test was applied.In non-Gaussian distributed data, the Mann-Whitney test was used.When assessing more than one group of continuous variables, a one-way analysis of variance (ANOVA) was applied.A Dunnett's multiple comparisons test was used for multiple comparisons in the oneway ANOVA.When assessing more than one group of categorical variables, a non-parametric Kruskal-Wallis test was applied with multiple comparisons testing.
Receiver operating characteristics (ROC) area under the curve (AUC) was created using the BigML platform, utilizing 10-fold cross validation.A Kaplan-Meier curve was generated using the R packages survival (v3.4-0) [34] and survminer (v0.4.9).Stratification of events, assessed as patients at risk for mortality at one year, was performed over 50-day increments for patients in Cluster A, Cluster B, and Cluster C. The P-value was calculated using the log-rank test in R. Using the survival package, a Cox Proportional Hazard model (CoxPH) for time-toevent analyses of mortality at one year was assessed.A risk score was generated with the (A) GRACE 2.0 score alone, (B) GRACE + Cluster A, (C) GRACE + LV global longitudinal strain, and (D) using all three variables through CoxPH regression.A probability score (i.e., ranging from 0 to 1) for predicting outcomes was generated using the predictRisk function of the riskRegression (v2022.11.28) package in R. The concordance index (C-statistic) was calculated using the pec (v2022.05.04) package in R [35].

Study overview
We evaluated patients (n = 155) presenting with NSTEMI and STEMI who had at least two of three apical echocardiographic views acquired during admission (Fig. 2A).Using echocardiography-derived ultrasomics, phenogroups were labeled through TDA and applied to the prediction of clinical outcomes, such as time-to-event mortality (Fig. 2B).A supervised machine learning algorithm was further used to characterize which ultrasomics features are important in prediction of the phenogroups and generation of risk prediction score.We then evaluated the incremental value of the phenogroups using the internal validation group and explored how assigned phenogroup labels contributed to predicting CMR findings in the external validation group (Fig. 2C).

Phenogroup Clustering through TDA
Using the online tool TDAView, three phenogroups were identified: Cluster A (n = 62), Cluster B (n = 43), and Cluster C (n = 50) (Fig. 3).Of these phenogroups, Cluster A and Cluster B are illustrated to be more homogenous in their connectivity within groups, whereas Cluster C is illustrated to represent a more heterogenous compilation of patients.Assessing the differences between these clusters, Cluster A contains more patients with a prior history of CHF (22.58% vs. 8.00%, P = 0.04), compared to Cluster C (Table 3).Further, the Cluster A phenogroup has a higher risk of all-cause mortality at 1 year (19.35% vs. 4.00%, P = 0.03), compared to Cluster C. The data in Table 2 highlight how the Cluster A represents a "highrisk" phenogroup, whereas Cluster B can be seen as "intermediate-risk" and Cluster C as "low-risk".When assessing the echocardiographic functional parameters (Table 4), Cluster A had a reduced LV ejection fraction (45% vs. 53%, P < 0.01) and LV global longitudinal strain (-11.88 vs. -13.87,P = 0.03) compared to Cluster C, respectively.

Supervised machine learning classifier for phenogroups
With only ultrasomics features, the phenogroup labels were predicted for Cluster A (ROC AUC: 0.95), Cluster B (ROC AUC: 0.95), and Cluster C (ROC AUC: 0.92) The generated probabilities from the supervised classifier were used to predict mortality and illustrate the incremental value of ultrasomics features over GRACE 2.0.The supervised classifier was applied to the external validation group to develop class labels, which were used to predict findings on cardiac magnetic resonance, including acute infarct size (Fig. 4A).When looking at the features contributing to the model, there was a mix of texture-based features and first order features (Fig. 4B).Prediction probabilities were generated for the internal validation dataset based on the supervised classifier; these probabilities were used in subsequent analyses for risk prediction.

Outcome prediction in the internal and external patient groups
Using mortality at one year, survival analysis revealed that patients assigned to Cluster A had a significant increase in mortality compared to Cluster C (log rank, P = 0.05) (Fig. 5A).We further wanted to further understand if the phenogroups, represented by changes in ultrasomics, had incremental value when predicting mortality.The concordance index was calculated for our four groups of variables: (A) GRACE 2.0 score alone, (B) GRACE + Cluster A, (C) GRACE + LV global longitudinal strain, and (D) using all three variables together (Fig. 5B).When examining GRACE scoring combined with ultrasomics (Concordance: 0.74 vs. 0.70, P = 0.04) and further adding LV GLS (Concordance: 0.81 vs. 0.70, P < 0.01), an increase in prediction of all-cause mortality is shown beyond that of the GRACE 2.0 score alone, respectively (Fig. 5C).
The developed supervised model was further applied to the external participants to assign phenogroup labels (i.e., Cluster A, B, and C).The batch prediction of the external dataset (n = 42 presenting with STEMI) labeled participants into Cluster A (n = 11), Cluster B (n = 23), and Cluster C (n = 8) (Table 5).Patients in Cluster A had a higher percentage of LV identified as "at risk" (60% vs. 37%, P = 0.04) at 5 days post AMI, compared to Cluster C.Moreover, patients in the Cluster A phenogroup had Table 1 Patient demographics of the Internal Validation Group Stratified by Acute myocardial infarction (AMI).Patients presenting with non-ST-elevation myocardial infarction (NSTEMI, n = 63) and ST-elevation myocardial infarction (STEMI, n = 92).Data are presented as the percent (%) of total or the 95% confidence interval, where applicable.Data are considered statistically significant if P ≤ 0.05, denoted by * and bolded text.BMI = body mass index, CHF = congestive heart failure, COPD = chronic obstructive pulmonary disease, CAD = coronary artery disease, CKD = chronic kidney disease, GRACE = Global Registry of Acute coronary events, MACE = major adverse cardiac events

Discussion
Properties of pathological changes within the myocardial microstructure influence ultrasound signal intensity distributions [29].Unlike information obtained indirectly (i.e., clinical risk factors, ECG, and biomarkers), specific analyzable trends in ultrasound texture information may have added insights into causal pathways that result in disease and clinical presentation.Integrating myocardial texture analysis (i.e., ultrasomics) with clinical data can provide a rich opportunity to develop machine learning models to predict adverse cardiac events following AMI, as ultrasomics can identify cellular changes in the myocardium [29,36].To this end we provide a proof-of-concept application of ultrasomics (i.e., cardiac ultrasound radiomics) in risk stratifying AMI patients.Three AMI phenogroups were identified according to ultrasound texture features with patients in phenogroup A having the worst prognosis.Phenogroup A showed incremental and independent information over GRACE 2.0 for predicting 1-year mortality after AMI.Using a clusterthen-predict framework we utilized an external hold out dataset for phenogroup prediction in which phenogroup A had large proportion of patients with moderate or large infarcts.
While classic supervised learning approaches require larger datasets, the cluster-then-predict methodology has the advantage of reducing bias, such as overfitting, when risk stratifying patients [37][38][39].Moreover this approach reduces prediction errors [40] and shows robust performance with echo-related data [41][42][43][44].Radiomics, deep learning features, 2D-echocardiography, demographic/ clinical (e.g., age, sex, race, BSA, BMI, comorbidities, family history, etc.), laboratory, and biomarker data can further be added to incrementally increase the risk-stratification of these phenogroups.Our group has previously utilized TDA to create patient similarity networks to identify aortic stenosis [45], diastolic dysfunction [46][47][48], and heart failure [49,50].In aortic stenosis, by creating patient phenogroups for mild and severe aortic stenosis, the "high-risk" severe aortic stenosis phenogroup was associated with increased risk of balloon valvuloplasty, and valve replacement [45].Specifically, as shown in this study, the phenotypic groups from TDA (or unsupervised machine learning, PCA clustering, etc.) can serve as class labels for developing supervised algorithms.This technique, first clustering and then predicting using supervised machine-learning models, can result in stronger associations with clinical outcomes by increasing the number of events (i.e., phenogroup clusters) and reduce class imbalance.
Current risk stratification tools for AMI, such as the GRACE Score, reduce mortality rates compared to standard strategies [51,52] but, with the use of current AI applications, it is possible to characterize more patients at-risk for morbidity and mortality by combining information from clinical, laboratory, imaging, and other features.Risk stratification tools can be benchmarked using AUC and C-Index as metrics, with values ranging from 0.6 to 0.7 having limited clinical value, whereas those between 0.7 and 0.8, 0.8-0.9, and > 0.9 considered to have fair, good, and excellent discrimination [53][54][55], respectively.The GRACE model has shown performances ranging from 0.65 to 0.8 (C-Index) [9], with our current study reporting a performance of 0.70, utilizing the GRACE 2.0 score, which is within the reported variation of the model.We also showed how the C-Index improved when using ultrasomics features (0.74) and in combination with LV functional parameters (0.81).As this is a feasibility study, future work should harness these non-clinical markers (such as ultrasomics and LV functional information) in larger, multicenter studies to create new risk stratification tools for the prediction of AMI.
We note several limitations to the current investigation.(1) The cohort sizes in the internal and external validations sets are relatively small (n = 155 and n = 42, respectively).While this patient groups are small, we highlight how the cluster-then-predict methodology is better adapted to smaller datasets and can help provide Table 4 Patient cardiac function of the Internal Validation Group for Predicted Ultrasomics Phenogroups.Using only the ultrasomics features from the A4C, A3C, and A2C echocardiogram views, patients were clustered into phenogroups.Cluster a "high-risk" (n = 62), cluster B "intermediate-risk" (n = 43), and cluster C "low-risk" (n = 50) using topological data analysis (TDA).Data are presented as the percent (%) of total or the 95% confidence interval, where applicable.Data are considered statistically significant if P ≤ 0.05, denoted by * and bolded text  [46][47][48][49][50] 46 [43][44][45][46][47][48][49] 45 [43][44][45][46][47] 0.17 Left Ventricular Internal Diameter -End Systole (mm) 37 [35][36][37][38][39][40] a framework for other investigations where small cohort sizes are present (i.e., rare diseases, underrepresented minorities, limited resources for data collection, etc.).Though, as the external validation cohort (n = 42) is further stratified into smaller clustered groups within our analysis, the generalizability of these results is limited and requires a larger external validation group in the future to assess the robustness of the current findings.
(2) The outcome of interest, all-cause mortality at 1 year, was only represented in 20 of 155 patients.Because of the low number of events, we used univariate analysis to screen for features to provide in the adjusted model while avoid issues with overfitting in the survival model.Nevertheless, we noted the incremental value of radiomics over conventional scores like GRACE 2.0 and several echocardiographic parameters like ejection fraction, LV end-systolic volume and global longitudinal strain.Future work with larger sample size and a greater number of events would allow develop of robust multivariable models using radiomics, clinical and conventional echocardiographic features.(3) The use of TDA, and other unsupervised learning approaches, can be subjective in the number of clusters defined.In the current study, we highlight three unique phenogroups.While we could have altered the parameters to include more or less numbers of phenogroups, the main constraint on the Mapper algorithm that we wanted to maintain was a low percent overlap between groups (i.e., reducing the similarities of phenogroups and ultimately providing clearer boundaries between those with "high" and "low" risk).

Conclusions
In summary, we utilize an echocardiography-derived approach to measure ultrasomics and identify phenogroups among patients presenting with AMI.Through TDA, three distinct phenogroups (Clusters A, B, and C) were delineated, with Cluster A representing a "highrisk" group, Cluster B an "intermediate-risk" group, and Cluster C a "low-risk" group.These phenogroups demonstrated significant differences in clinical outcomes, particularly in terms of all-cause mortality at 1 year.Logistic regression and supervised machine learning further validate the predictive power of these phenogroups, showing their potential utility in clinical risk stratification.

Fig. 1
Fig.1Recruitment Diagram.Patients (n = 208) were retrospectively identified over a 6 month timeline who were admitted for AMI.Of these patients, 155 patients were included in the study who had at least two of three apical echocardiographic views available for analysis.Using ultrasomics features from the images, topological data analysis was used to cluster patients into three groups.These three groups were assessed in a supervised machine learning algorithm to develop class labels for the external validation group.Ultimately, groups clustered using ultrasomics features were assessed for prediction of all-cause mortality and left ventricular infarct size

Fig. 2
Fig. 2 Study Design and Overview.(A) The internal validation patient cohort presenting with non-ST-elevation myocardial infarction (NSTEMI, n = 63) and ST-elevation myocardial infarction (STEMI, n = 92) (B) Ultrasomics features were extracted and TDAView was used to cluster patients into three phenogroups: Cluster A, Cluster B, and Cluster C. The identified phenogroups were used to develop class labels for the external validation group using a supervised classifier.(C) The generated probabilities from the supervised classifier were used to predict mortality and illustrate the incremental value of ultrasomics features over GRACE 2.0.The supervised classifier was applied to the external validation group to develop class labels, which were used to predict findings on cardiac magnetic resonance, including acute infarct size

Fig. 3
Fig. 3 Topological Data Analysis (TDA) Clustering of Ultrasomics Features.Individual nodes are represented as red circles, with the number next to the node corresponding to the number of patients included in the node.Cluster A (n = 62), Cluster B (n = 43), and Cluster C (n = 50) Left Ventricular Internal Diameter -End Diastole (mm) 48

Fig. 4
Fig. 4 Supervised Machine Learning Classifier.(A) Prediction of phenogroup labels on the internal validation dataset using only ultrasomics (B) The top five features contributing to model development for the supervised machine learning classifier

Fig. 5
Fig. 5 Performance of Phenogroups in Assessing All-Cause Mortality.(A) Kaplan Meyer curve and stratified risk categories for patients in phenogroups Cluster A, Cluster B, and Cluster C. (B) Time-to-event Concordance Index (C-Index) for groups (1) GRACE 2.0 score alone, (2) GRACE + Cluster A, (3) GRACE + left ventricular global longitudinal strain (GLS), and (4) using all three variables through CoxPH regression.(C) Incremental value of ultrasomics features (i.e., Cluster A) in predicting all-cause mortality, over the 1-year follow-up period.GRACE = Global Registry of Acute Coronary Events

Table 2
Patient cardiac function of the Internal Validation Group Stratified by Acute myocardial infarction (AMI).Patients presenting with non-ST-elevation myocardial infarction (NSTEMI, n = 63) and ST-elevation myocardial infarction (STEMI, n = 92).Data are presented as the percent (%) of total or the 95% confidence interval, where applicable.Data are considered statistically significant if P ≤ 0.05, denoted by * and bolded text

Table 3
Patient demographics of the Internal Validation Group for Predicted Ultrasomics Phenogroups.Using only the ultrasomics features from the A4C, A3C, and A2C echocardiogram views, patients were clustered into phenogroups.Cluster a "high-risk" (n = 62), cluster B "intermediate-risk" (n = 43), and cluster C "low-risk" (n = 50) using topological data analysis (TDA).Data are presented as the percent (%) of total or the 95% confidence interval, where applicable.Data are considered statistically significant if P ≤ 0.05, denoted by * and bolded text.BMI = body mass index, CHF = congestive heart failure, COPD = chronic obstructive pulmonary disease, CAD = coronary artery disease, CKD = chronic kidney disease, STEMI = ST-elevation myocardial infarction, GRACE = Global Registry of Acute coronary events, MACE = major adverse cardiac events * = P ≤ 0.05 when comparing all groups ** = P ≤ 0.05 when comparing Cluster A vs. Cluster C

Table 5
Patient demographics of the External Validation Group for Predicted Ultrasomics Phenogroups.Class labels were generated for the external hold out dataset (i.e., the prospective, multicenter, randomized DTU-STEMI pilot trial dataset).Labels were applied based solely on ultrasomics features from the A4C, A3C, and A2C echocardiogram views.Data are considered statistically significant if P ≤ 0.05, denoted by * and bolded text.BMI = body mass index, CHF = congestive heart failure, COPD = chronic obstructive pulmonary disease, CAD = coronary artery disease, CKD = chronic kidney disease, MACE = major adverse cardiac events, LV = left ventricular * = P ≤ 0.05 when comparing all groups ** = P ≤ 0.05 when comparing Cluster A vs. Cluster C